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ADAPTIVE SCHEDULING OF DATA DELIVERY IN A CENTRAL SERVER 

5 Cross-Reference to Related Application 

This application is a continuation of pending U.S. Application No. 09/336,981, filed June 
21, 1999, which is incorporated by reference herein. 

Field of the Invention 

10 This invention relates generally to communication systems, and more particularly to a 

centralized scheduling arrangement of job requests in a multiple-channel, point-to-point system. 

Background of the Invention 

The boom in the hitemet and the rise of new network technologies have focused attention 
15 on designing faster and more efficient data networks. A key component of the data network is the 
data server. Data servers are the engines that store and feed content to diverse clients over the 
network media. Data servers can take many forms, such as infestations, wireless gateways, web 
servers or specialized servers such as traffic or weather information servers. Increasingly, data 
servers employ a multiple-channel, point-to-point model. 
20 For example. Figure 1 illustrates a typical multiple-channel communication system of the 

prior art, which satisfies job requests in a point-to-point, or unicast, fashion. In a point-to-point 
system, each client*s request for data is required to be individually satisfied, even if more than one 
client makes a job request for the same data item. In this regard, a point-to point system is different 
fi-om a broadcast system, which is used to advantage by satisfying all of the pending job requests for 
25 a data item with a single broadcast of the data item. One of many examples of a point-to-point 
system is a centralized Internet server which provides web page data in response to requests firom 
different Internet users. 

In Figure 1, devices 30a through 30w make requests for data which are delivered to central 
server 10 via link 36, or by any other method. Specifically, device 30a makes a job request for data 
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item a, device 30b makes a job request for data item b, and device 30w makes a job request for data 
item w. Central server 10 receives the job requests and retrieves the requested data from either an 
internal database or an external database (as will be shown below). Central server 10 stores the job 
requests in queue 12 and then transmits the requested data to one of a plurality of local channel 

5 servers, designated as 40- 1 through 40-k, Each local channel server provides data to a corresponding 
data chaimel designated Ci through Cj,. When requested data is received by one of the local channel 
servers, the local channel server then services the job by transmitting the requested data to the 
requesting device via its corresponding channel. Channel servers may employ signal processors that 
modulate the data to appropriate channels, such as in many proposed wireless base stations. 

10 Additionally, a scheduling arrangement is employed by the system. The scheduling 

arrangement determines the order in which the job requests are serviced. In the prior art, one method 
which is typically employed for this purpose is a centralized scheduling method. According to a 
centralized scheduling method, the central server has a corresponding queue, such as queue 12 
shown in Figure 1 . When a job request is received by the central server, the central server stores the 

15 data in its corresponding queue along with other pending job requests which have not yet been 
serviced. Local channel servers service the job requests which are sent to them by the central server. 
When a local channel server completes a job requests, it sends a signal to the central server. The 
signal instructs the central server to send the next pending job request for transmission via the local 
channel server's corresponding channel. 

20 According to a centralized scheduling scheme, the central server employs a scheduling 

algorithm in order to determine which of the jobs pending in its corresponding queue are to be sent 
to a local channel server so as to be serviced first. One scheduling algorithm which is typically 
employed by a central server in a communication system employing a centralized scheduling method 
is a "roimd-robin" arrangement. Basically, the central server assigns data relating to a job request 

25 to the next available local channel server based on a first-in first-out scheme. However, this 
arrangement may be imsatisfactory, specifically with widely heterogeneous data requests 
(heterogeneous job requests are job requests having varying sizes). 
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Another scheduling algorithm employed by data servers is known as the "Shortest Remaining 
Processing Time" (hereinafter referred to as "SRPT*') algorithm. The SRPT algorithm produces a 
schedule which minimizes the time it takes to process' all of the uncompleted jobs in a queue. The 
SRPT algorithm is typically employed when jobs arrive in a continuous stream, and is based upon 
5 a sum-flow metric. The relevant parameter of the sum-flow metric is the time a job spends in the 
system. The SRPT algorithm employs the sum-flow metric by summing the time that all jobs spend 
in the system, and schedules the pending jobs so as to minimize this summed amount of time. 

However, the SRPT algorithm has the drawback that it leads to starvation. Starvation occurs 
when some job request to the server is delayed to an unbounded extent. For instance, starvation may 

1 0 occur when the servicing of a pending job request is continually delayed because the SRPT algorithm 
determines that incoming job requests are serviced prior to the pending job request. Although the 
SRPT algorithm can be desirable in some circumstances, the fact that specific job requests are 
delayed to an unbounded extent is unfair to the person who made the request which is delayed. 
Furthermore, the fact that some job requests are delayed to an unbounded extent prevents the owner 

15 of the server system from being able to make a quality-of-service guarantee to each user that the 
schedule will be responsive to each job and avoid starvation of any job. 

Although the above-referenced scheduling methods have attempted to address data server 
performance issues, there is an ever increasing need for improved scheduling methods that provide 
satisfactory performance. Therefore, there exists a need for a system and method for optimally 

20 scheduling the transmission of data items in multiple-channel, point-to-point communication system. 

Summary of the Invention 

In accordance with one embodiment of the invention, a system and method for scheduling 
responses in a multiple-channel, point-to-point communication system are disclosed. The method 
25 comprises the steps of receiving at a central server a plurality of job requests, and detemining an 
adaptive schedule for servicing the job requests via the multiple channels. In a departure from prior 
art scheduling methods for multiple-channel, point-to-point communication systems, wherein a 
response to a job request is transmitted entirely before a following response is transmitted, the 
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schedule in accordance with the present invention is adaptively determined such that the servicing 
of a first job request via a first channel may be interrupted and serviced via a second channel upon 
receipt of a new job request at the central server, so as to service a second job request in accordance 
with an updated schedule. 

5 According to one embodiment, the central server updates the schedule upon the arrival of 

each new job request. Each time the schedule is updated by the central data server, the data 
corresponding to the job requests that need to be served based on the updated schedule is sent to 
corresponding local channel servers. In one embodiment, the first k job requests are scheduled, 
where k is the number of available local channel servers that can service the job request. 

10 In accordance with another embodiment of the invention, a central server of a 

multiple-channel, point-to-point communication system computes a feasible stretch value for use in 
scheduling heterogeneous job requests to be serviced. A stretch value provides an indication of the 
delay experienced by each job request to complete, when the server processes many job requests 
concurrently. In one embodiment, for a set of job requests to be serviced in an off-line setting, a 

1 5 processing time is calculated for each job request based on the size of the job and the transmission 
speed or bandwidth of the channels. Thereafter, a stretch value is proposed and the central server 
computes a deadline for each job to be the arrival time of the job request plus the product of the 
processing time and the proposed stretch value. Thereafter, each job request is scheduled, based on 
an "earliest deadline first" (EDF) arrangement, wherein the job request that has the earUest deadline 

20 is scheduled first, the job request having the next earliest deadline is scheduled second, etc. 

The proposed stretch value is deemed feasible if each pending job request can be completed, 
if performed in the order determined by the earliest deadline first methodology, prior to its deadline. 
If the proposed stretch value is deemed not feasible, it is adjusted iteratively until a feasible stretch 
value is found. Once the feasible stretch value is found, the job requests are serviced in the order 

25 dictated by the EDF schedule as found in the latest iteration. According to one embodiment, the 
feasible stretch value is fiirther adjusted until an optimal feasible stretch value is found, which equals 
the smallest feasible stretch value which permits, when serviced in the order dictated by the EDF 
schedule as found in the current iteration, all pending jobs to be completed prior to their deadline. 
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In another embodiment, job requests are serviced by a central server in an on-line setting, in 
which the system services pending job requests prior to determining whether a modified stretch is 
feasible. In accordance with another embodiment of the invention, the system determmes a schedule 
for a set of pending job requests as a function of all of the job requests from the arrival of the first 
5 job request to the system up to the arrival of the current job. In accordance with yet another 
embodiment of the invention, the system determines a schedule for a set of pending job requests as 
a function of the last n number of job arrivals, wherein n is an integer. 

Brief Description of the Drawings 

10 The present invention will be further understood from the following description with 

reference to the accompanying drawings. 

Figure 1 illustrates a multiple-channel, point-to-point communication system, as employed 
in the prior art; 

Figure 2 illustrates the components of a central server, in accordance with one embodiment 
15 of the present invention; 

Figure 3 is a flow chart which illustrates the steps of scheduling and processing a plurality 
of job requests in the off-line setting, in accordance with one embodiment of the present invention; 

Figure 4 is a flow chart which illustrates the steps of scheduling and processing a plurality 
of job requests in the on-line setting, in accordance with one embodiment of the present invention; 
20 Figure 5 is a table which illustrates the performance of an algorithm referred to as the OPT 

algorithm, in accordance with one embodiment of the invention; and 

Figure 6 is a table which illustrates the performance of the SRPT, MAX and BASE 
algorithms, in accordance with one embodiment of the invention. 

25 Detailed Description of the Invention 

In accordance with one embodiment, the present invention comprises a system and method 
for centrally scheduling the servicing of job requests for data items in a multiple-chaimel, 
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point-to-point communication system. For instance, the requests for data can be made via wireless 
interface, over phone lines, by computer interface, etc. 

As previously discussed. Figure 1 illustrates a typical multiple-channel, point-to-point 
communication system, which employs a centralized scheduling method. Devices 30a through 30w 
5 make requests for data which are delivered to central server 1 0 by any method. Specifically, device 
30a makes a job request for data item a, device 30b makes a job request for data item b, and device 
30w makes a job request for data item w. Central server 10 receives the job requests and retrieves 
the requested data from either an internal database or an external database. Central server 10 then 
transmits the requested data to one of the plurality of local channel servers 40- 1 through 40-k, which 
1 0 then services the job by transmitting the requested data to the requesting device via its corresponding 
channel. 

Central server 10 of the system employs a centralized scheduling arrangement to determine 
the order in which the job requests are serviced. According to a centralized scheduling method, 
central server has corresponding queue 12. When a job request is received by central server 10, 

1 5 central server 10 stores the data in queue 12 along with other pending job requests which have not 
yet been serviced. Local chaimel servers service the job requests which are sent to them by central 
server 10. When a local channel server completes a job request, it sends a signal to central server 
10, The signal instructs central server 10 to send to the available local channel server the next 
pending job request for transmission via the local channel server's corresponding channel. Central 

20 server 10 employs an adaptive scheduling algorithm in order to determine which of the jobs pending 
in its corresponding queue are to be sent to a local channel server so as to be serviced first. 
According to one embodiment of the present invention, central server 10 employs an off-line 
scheduling algorithm referred to as a BASE algorithm. According to another embodiment of the 
present invention, central server 10 employs an on-line scheduling algorithm referred to as a MAX 

25 algorithm. 

Figure 2 illustrates the features of a typical central server 10 that employs various 
embodiments of the present invention. The scheduling arrangement of the present invention may 
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be employed in any central server system that is configured to receive a job request firom a plurality 
of clients. 

In Figure 2, central server 1 0 is configured to transmit data items via links 34- 1 through 34-k 
to a plurality of local channel servers/channels in response to requests received firom clients via link 
5 36. As shown in Figure 1, these requests may take the form of requests for data items a, b and w by 
client devices 30a, 30b and 30w, respectively. According to one embodiment, central server 10 
includes a plurality of storage devices such as 25-27, which are configured to provide the plurality 
of data items upon request. 

Central server 10 includes, in one embodiment, an admission control circuit 21, which is 

1 0 configured to analyze an incoming request and determine whether the request will be acknowledged 
or denied. The request might be denied for any number of reasons, e.g., the person making the 
request is not authorized to make the request, the request is incompatible with the server system, etc. 
An input port of admission control circuit 21 is coupled to link 36, which is configured to deUver 
the job requests. Admission control circuit 2 1 receives and stores for each job request, among other 

15 things, the time the request arrives, the length of the request, the data item requested, and the 
bandwidth or data rate necessary to serve the request. 

In accordance with one embodiment of the invention, controller 22 is a microprocessor. An 
input port of controller 22 is coupled to an output port of admission control circuit 2 1 . Additionally, 
an output port of controller 22 is coupled to an input port of admission control circuit 21 . Controller 

20 22 employs the scheduling arrangement described herein, by performing the calculations and 
determining which job requests to transmit to a local channel server for servicing via the local 
channel server's corresponding charmeL Pending job requests are stored in a memory buffer such 
as queue 12. Storage devices 25-27 are configured to provide data to an internal bus system 24 in 
response to signals provided by controller 22, although the invention is not limited in scope in that 

25 respect. Data retrieved firom storage devices 25-27 is provided to the local channel servers via 
interface unit 32 and links 34-1 through 34-k. 

It should also be noted that, although one embodiment of the invention is disclosed in the 
context of discrete fimctioning elements such as an admission control circuit and a controller, it is 
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understood that different discrete functioning elements or software could be employed to carry out 
the present invention. 

In one embodiment of the invention, admission control circuit 2 1 has prior knowledge of the 
arrival time of job requests to be scheduled by controller 22. A scheduling system based on such an 
5 arrangement is referred to as "off-line" scheduling. Thus, in an off-line scheduling arrangement, the 
system has knowledge of all the jobs that need to be serviced, and can determine the optimal 
schedule for the job requests prior to starting to service the job requests. The steps conducted by one 
enibodiment of the present invention, which employs an off-line scheduling arrangement, are shown 
and described in Figure 3. It is noted that the "off-line" scheduling arrangement in accordance with 

10 the present invention leads to an optimal scheduling benchmark against which other scheduling 
arrangements may be compared. 

In another embodiment of the invention, admission control circuit 2 1 does not store all of the 
job requests before directing them to controller 22 or have prior knowledge of the arrival times of 
future job requests. A scheduling system based on such an arrangement is referred to as "on-line" 

1 5 scheduling. Unlike an off-line system, in which a series of calculations to find the optimal schedule 
is performed prior to actually servicing the job requests, an on-line system is one in which the 
controller determines a schedule, based upon job requests which have previously been processed, 
while simultaneously processing the currently pending job requests. The steps conducted by one 
embodiment of the present invention, which employs an on-line scheduling arrangement, are shown 

20 and described in Figure 4. 

Figure 3 is a flow chart which illustrates the steps of an off-line scheduling arrangement 
which schedules the transmission of a plurality of data items in response to job requests, referred to 
as the BASE scheduling method. At step 100, the arrival time, A^, of a job request is stored by 
central server 10. The arrival time of the job request is the point in time when central server 10 

25 receives the job request. 

At step 102, central server 10 calculates a processing time, P|, for the job request. The 
processing time for a job request is the amount of time it would take for central server 10 to complete 
the job request if central server 10 could service the job request immediately and without 
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interruption, i.e., without the job request waiting for other jobs to finish before starting, and without 
the service of the job request being preempted by central server 10 to service other requests. In the 
preferred embodiment, central server 10 calculates the processing time as a function of the size of 
the data items, such as by dividing the size (i.e., in bytes) of the data item requested by the 
5 transmission speed of the channels (e.g., in bytes per second). 

At step 104, central server 10 proposes a stretch value, S, to utilize in scheduling the job 
requests. A stretch value provides an indication of the delay experienced by each job request to 
complete, when the server processes many job requests concurrently. Specifically, a stretch value 
exists because, in the preferred embodiment of the invention, central server 1 0 employs preemption. 

10 Preemption is a scheduling technique in which a server alternates between a plurality of different 
pending job requests, servicing portions of each job request until eventually all of the job requests 
are complete. Thus, the stretch value is defined herein as a ratio equal to the time that central server 
10 requires to service a particular job request while also serving other uncompleted job requests in 
the system, divided by the time that central server 10 would require to service that particular job 

15 request if no other job requests were required to be serviced. It should be noted though that, 
although a stretch value can exist for an individual job request, the stretch value is preferably not 
calculated separately for each individual job request but is instead proposed as a first step for 
application to the entire set of pending job requests, as will be shown in step 106 below. Initially, 
the proposed stretch value can be a random or arbitrarily chosen number or can be based on 

20 previously found stretch values. 

Once step 104 is satisfied and a stretch value has been proposed, central server 10 proceeds 
to step 106. At step 106, central server 10 uses the proposed stretch value to calculate the deadlines, 
Dj, for all of the uncompleted job requests in the central server queue remaining to be serviced. For 
an uncompleted job request /, the deadline, Dj, is equal to the arrival time of that job, A^, plus the 

25 product of the proposed stretch value, S, times the processing time, Pj, such that: 

Di = SxPi + Ai^ 



9 



Acharya 18-18-18 

At step 108, once the deadline for each uncompleted job is calculated, central server 10 
schedules the jobs, in one embodiment of the present invention, according to an earliest deadline first 
("EDF") methodology. According to the EDF methodology, central server 1 0 initially schedules for 
service, via k local channel servers, the A job requests which have the earliest deadlines, as found in 
5 step 106, relative to all of the other jobs. As local channel servers complete the responses to their 
corresponding job requests, central server 10 then selects the next available job request with the 
earliest deadline and schedules it to be serviced by the available local channel servers. This process 
continues until all of the job requests have been scheduled. 

At decision step 1 1 0, central server 10 inquires whether, if serviced according to the schedule 
1 0 determined by the EDF methodology, each and every one of the job requests will be completed prior 
to its respective deadhne as calculated in step 106. If any job request is not able to be completed 
prior to its respective deadline, then the proposed stretch value is not feasible and the job requests 
are not serviced according to the schedule determined by the EDF methodology in step 108 of the 
current iteration. 

1 5 Instead, central server 1 0 preferably increases the proposed stretch value at step 112. From 

step 1 12, central server 10 returns to step 106. Central server 10 re-performs steps 106 to 1 10 in 
order to check the feasibility of the new proposed stretch value. Central server 1 0 repeats these steps 
iteratively, progressively increasing the stretch value, until a feasible stretch value is found which 
enables all of the job requests, when scheduled in step 108 according to the EDF methodology, to 

20 be completed within their respective deadlines (as calculated in step 106). 

When a feasible stretch value is found, central server 10 proceeds to step 114, where it 
services all of the job requests in the order dictated by the EDF schedule, as found in step 108 of the 
current iteration. Specifically, central server 10 services, via the k local channel servers, the kjoh 
requests which have the earliest deadlines. Central server 1 0 then services the following job requests 

25 with the earliest deadlines via the first available local channel servers that complete servicing their 
job requests. 

In accordance with one embodiment of the invention, the schedule determined by controller 
22 incorporates migration between available local chaimel servers. For instance, if a job request for 

10 



Achaiya 18-18-18 

a first data item is currently being serviced by a first channel, an un-serviced portion of the data item 
may be returned firom that local channel server to the central server to be serviced by a second 
channel, in order that the first channel instead services a subsequently arriving job request for a 
second data item. This may occur when the second data item is relatively small, and the job 

5 requesting it has an earlier deadline than the job request for the larger first data item. This migration 
scheduling arrangement prevents the delay in servicing the small second data item firom adversely 
affecting the performance of the schedule. 

In one embodiment of the invention, the adjustment at step 1 12 (of the flowchart in Figure 
3) to the proposed stretch value is accompUshed by doubling the previously proposed stretch value. 

10 In yet another embodiment, once the system determines that a feasible stretch value falls between 
two integers, k and 2k, the feasible stretch value is adjusted fiirther by performing a binary search 
for an optimal feasible stretch value. The optimal feasible stretch value is the smallest feasible 
stretch value that enables all of the job requests, when scheduled in step 108 according to the EDF 
methodology, to be completed within their respective deadlines (as calculated in step 106). 

1 5 This off-line scheduling arrangement, in which server 1 0 calculates an adaptive schedule for 

a known set of pending job requests by utilizing the EDF methodology, is referred to as the BASE 
method. Because the set of pending job requests is known, the BASE method is able to generate a 
near optimal schedule to serve the job requests. However, the BASE method is not always desirable 
to use. For instance, while it is possible to utilize the BASE method in an on-line setting, it may not 

20 be practical to do so. This follows because the BASE scheduling method requires the controller to 
determine a schedule based on all incoming job requests, which requires large amounts of memory 
and processing power. Instead, in the on-line setting, prior stretch values, or modified stretch values 
are utilized, as will be discussed below. 

Figure 4 is a flow chart which illustrates the steps of an on-line scheduling arrangement 

25 which adaptively schedules the transmission of a plurality of data items in response to a continuoxis 
stream of job requests, referred to as the MAX scheduling method. As previously mentioned, an 
on-line scheduling arrangement is one which must determine how to schedule the job requests based 
upon job requests that have already been received. Thus, a server system which employs an on-line 

11 



Achaiya 18-18-18 

scheduling arrangement begins servicing the job requests before determining an optimal stretch 
value. 

At step 1 20 of the flow chart, a new job request arrives at central server 1 0. Typically in the 
on-line setting, central server 10 will currently be servicing a plurality of pending job requests when 
5 the new request arrives. At step 122, the arrival time, A^, of the newly arrived job request is stored 
by central server 10. 

At step 124, central server 10 calculates and stores a processing time, P„ of the arrived job 
request. As shown and described in reference with Figure 3, the processing time for a job request 
is the amount of time it would take for central server 10 to complete the job request if central server 

10 10 could service the job request immediately and without interruption, i.e., without the job request 
waiting for other jobs to finish before starting, and without the service of the job request being 
preempted by central server 10 to service other requests. As previously indicated, in the preferred 
embodiment, central server 10 calculates the processing time as a function of the size of the data 
items, such as by dividing the size (i.e., in bytes) of the data item requested by the transmission speed 

15 of the channels (e.g. , in bytes per second) . 

In accordance with one embodiment of the invention, instead of considering all of the job 
requests arrived to the server, the on-line scheduling arrangement considers the job requests that 
have arrived during a history window, defined by a history window parameter. A history window 
parameter comprises a specified set of job requests having n number of job requests, where w is an 

20 integer. The value of n is advantageously chosen by the operator of server system 10. While it is 
possible that n equals the total number of jobs serviced by the system, it may be desirable to avoid 
storing or calculating arrival times and processing times for the entire history at every arrival. 
Therefore, in one embodiment, « is a finite number in the range of 10 to 100, corresponding to the 
last 10 to 100 jobs which were serviced by the system immediately proceeding the arrival of the 

25 presently arrived job. Not only does the selection of an n value in this range control the cost of 
scheduling, but also prevents a momentary surge of arrivals a long time ago fi'om inducing the 
controller to compute a large processing time ratio ever after. 

12 
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At step 126, central server 10 determines a modified stretch value, S*. The modified stretch 
value S* is the maximum stretch value for a completed job request found during a specified history 
window parameter. At step 128, central server 10 uses the modified stretch value, S*, to calculate 
the deadlines, D, for all of the uncompleted job requests remaining to be serviced. In accordance 
5 with one embodiment, this includes the job requests that are currently being serviced via each of the 
local channel servers and the job requests that are stored in queue 12 waiting to be serviced. For 
each job request /, the deadline, Dj, is equal to the arrival time of that job request, A^, plus the 
product of the modified stretch value, S*, times the processing time, P^, such that: 

10 DrS*xPi+Ai. 

At step 130, in accordance with one embodiment of the present invention, central server 10 
determines a schedule for the job requests in accordance with an EDF methodology. According to 
the EDF methodology, central server 10 initially schedules for service, by providing the k local 

15 channel servers, the A: job requests which have the earliest deadlines. Central server 10 then selects 
the next job requests with the earliest deadlines and schedules them to be serviced by the first 
available local channel servers that complete servicing their job requests. 

At step 132, central server 10 transmits the ^ job requests having the earliest deadlines, 
according to the EDF schedule determined in step 1 30, to the k local channel servers for transmission 

20 to the clients that requested the data item. At step 134, when a job request has been completely 
serviced via a particular channel, the local channel server that corresponds to the channel sends a 
signal to central server 10. The signal indicates to the central server that the job request is complete 
and that the channel is now available to service another job request. In addition, according to one 
embodiment, upon receiving the signal, central server 10 determines the completion time of the job 

25 request. 

At step 136, central server 10 determines if, at the modified stretch value, S*, the completed 
job request was completed prior to its respective deadline. If so, the modified stretch value is 
feasible and the system proceeds to step 140, where central server 10 maintains the current modified 
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stretch value. At step 142, central server 10 then transmits to the available channel the job request 
in its queue that has the next earliest deadline. The method then returns to step 134, which is 
performed when the next job request is completed via one of the channels. 

If, using the modified stretch value, S*, the completed job request is not completed prior to 
its corresponding deadline, then the modified stretch value is not feasible. The method then proceeds 
to step 138, and the modified stretch value is adjusted. After step 138, the method then returns to 
step 128, and the method repeats, i.e., central server 10 recalculates the deadlines for all of the job 
requests, reschedules the job requests according to the EDF methodology, and transmits the A: job 
requests having the earliest deadlines to the k local channel servers to be serviced via the 
corresponding channels. However, in accordance alternative embodiment of the invention, the 
deadlines calculated in step 128 remain constant even after the modified stretch value is adjusted at 
step 138, and the adjusted modified stretch value is used to calculate deadlines for the newly arriving 
job requests only. 

As may be the case, a subsequent job request will arrive at central server 10 to be serviced 
while each of the channels are processing the existing job requests. If this is the case, then central 
server 10 returns to step 120 of the flow chart to re-perform steps 120 through 140. Thus, the EDF 
schedule of pending job requests determined by central server 10 is updated to include the newly 
arrived job request. 

As previously discussed, in accordance with one embodiment of the invention, the schedule 
determined by controller 22 of central server 1 0 incorporates migration. For instance, if a job request 
for a first data item is currently being serviced by a first channel, an un-serviced portion of the data 
item may be returned fi'om that local channel server to the central server to be serviced by a second 
channel, in order that the first channel instead service a subsequently arriving job request for a 
second data item. This may occur when the second data item is relatively small, and the job 
requesting it has an earlier deadline than the job request for the larger first data item. This migration 
scheduling arrangement prevents the delay in servicing the small second data item from adversely 
affecting the overall performance of the schedule. 
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Each of the methods discussed above, which determine a modified stretch value, utiHze the 
EDF methodology to schedule the job requests. It is possible, however, to use other scheduhng 
methodologies. For instance, the SRPT method, which schedules the pending job requests according 
to the remaining time to process each request, can be utilized. 
5 As previously mentioned, the present invention seeks to find a schedule that balances the 

conflicting requirements of the individual performance for each job request with the global 
performance of the system, such that the system is responsive to each job request and avoids 
starvation of any job request. In other words, the optimal schedule balances the need to minimize 
the worst case completion time of a job request and the need to minimize the average response time 

1 0 for the job requests. In addition, the optimal schedule balances the need to minimize the worst case 
stretch of a job request and the need to mmimize the average stretch values for all of the job requests. 

In order to measure the effectiveness of the BASE and MAX algorithms, their performance 
can be compared to algorithms which are known to be near-optimal with respect to certain metrics. 
For instance, a metric which is employed in order to minimize the maximum stretch value 

1 5 (hereinafter ''MaxStr'') experienced when employing multiple channels to service a plurality of job 
requests is referred to as OPT. The OPT algorithm is an off-line algorithm similar to BASE, except 
that unlike the BASE algorithm, the OPT algorithm calculates and schedules pending job requests 
so as to minimize the MaxStr at specific time intervals, by designating that each user may have only 
one outstanding job request at any time. 

20 Figure 5 illustrates some of the performance characteristics of the OPT algorithm in a model 

of the multi-channel environment. The performance was determined for a model in which a total 
downlink bandwidth of 128 Kbits/second was employed, divided between a number of chaimels 
ranging firom 5 to 20 channels (thus rendering the bandwidth of each channel within a range between 
6.4 and 25.6 Kbits/second). For instance, for a given set of job requests, the minimum MaxStr 

25 obtained by the OPT algorithm is 23,2, while the average stretch of all of the job requests was 9.8 
and the average response time for all of the jobs was 12.9 seconds. 

Figure 6 illustrates the performance of the SRPT, MAX and BASE algorithms as employed 
by a central server that schedules job requests for service via multiple channels, in accordance with 

15 



Acharya 18-18-18 

the model described above. As shown, the MAX algorithm has the lowest MaxStr of all of these 
algorithms, at 23.98, which is only slightly higher than the OPT algorithm (which is known to 
minimize the MaxStr value). In addition, the MAX algorithm has the lowest maximum response 
time of the three algorithms, at 11,775 seconds, while having slightly higher, but still fairly low, 
average stretch and average response time values, at 1 .3 and 5.2 seconds, respectively. By contrast, 
the SRPT algorithm of the prior art has a significantly higher MaxStr value at 34.4. 

Unexpectedly, the MAX algorithm also has a lower MaxStr than the BASE algorithm (the 
BASE algorithm has a MaxStr of 24.09), even though the BASE algorithm is known to be nearly 
optimal in this regard in the single channel context. This resuh illustrates that an optimal algorithm 
employed in a communication system having only one channel is not necessarily the optimal 
algorithm to employ in a multiple-channel communication system. 

Thus, since it is important when making a quality of service guarantee to clients to minimize 
the MaxStr^ the MAX and BASE algorithms perform better than the SRPT algorithm. It is also 
important to note that the MAX algorithm, which is an on-line algorithm, is more practical than the 
BASE algorithm, which is an off-line algorithm. 

While there has been shown and described particular embodiments of the invention, it will 
be obvious to those skilled in the art that changes and modifications can be invention, made therein 
without departing firom the invention, and therefore, the appended claims shall be understood to 
cover all such changes and modifications as fall within the true spirit and scope of the invention. 
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